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SEQUENCE LISTING 



<110> 
<120> 

<160> 

<210> 
<211> 
<212> 
<213> 

<220> 
<221> 
<222> 
<223> 

<221> 
<222> 
<223> 

<221> 
<222> 
<223> 



Galilaeus Oy 

GeAe cluster involved in nogalamycin biosynthesis, and its use in 
pronuction of antibiotics 

18 

1 

16020\ 
DNA 

S-t rep thorny ces nogalater ATCC 27451 



misc^f eature 
3799., 38D0 

"overlapp^ing sequence in the genes snoaH and SiiogN* 

misc_f eati 
6334. . 6356\ 

"overlappii\g sequence in the genes snoaG and snogC* 



misc_f eature 
13201. .13300^ 
"unknown regik>n* 



<400> 1 

agatctcgtc cgccagtgcc tcAgtgaccg gcaacgagcc cttggcgtag ccgagatggg 
agaaaccggt catggtgtgc acgagccagg gataactgat gttgagggcg atgtcgtagg 
aggcgcgcag ggcctccagc accgogtccc gtcgcggatg gcgcaccacg tacacgtagt 
agacgtgctc gttgccctgc gcggtoctcg gcagcagcag ccccgtgtcc gccaggccct 
cctcatagcg gcgtgccacc gcccggoggg cctcgatgta ggacggcaac cgggacagct 
tgcgccgcag gatctctgcc tgtacttopt ccagccggct gttgtgcccg ggggtttcga 
cgacgtagta gcggctctcc atgccgtacft agcgcagccg ccgcagccgg tccgccaccc 
gctcgtcgtc ggtgagcacc gcgccgccgtV ccccgtacgc gcccagcacc ttggtcgggt 
agaaggagaa cgcggccgcg tcaccggtcg agccggcgag tcggccgtgc cggcgcgccc 
cgtgcgcctg cgcgcagtcc tccaggatca coaggttgtg ccgggcggcc agatcgcgca 
gcggtgccat gtccacgcac tgcccgtaga ggtagaccgg cagcagacac cgggtgcgtg 
gcgtgaggac ggcctccacc tgggacgtgt ccatcaggta gtcctcctcg cgcacgtcca 
cgaagacggg cgtggcaccg gccgagtcga tcgcgacgac cgtgggcgcg gcggtgttgg 
acacggtgac gacctcgtcg ccgggcccga cacccaaggc ctgtaacccc agcttgacgg 
cgttggtccc gttgtcgacg ccgacggcat gtccgac^c ctggaatgag gcgaactcgg 
actcgaagcc gcgcacgctc tcaccgagga cgagccggcc ggagcggaac accgtctcca 
cggcatcgtg gatgtcctcg cgttccagct cgtattccgg \cagatagt cc cacacgtgta 
cggtcatcga gcccctccgg gattctccct gcgaaaagtc ^cactctac gacaacgttc 
accactcgct ttttcctcaa cgtccgcttg agacggcccg gcctgctgtg gcccggggaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
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aggtgcggtc 


gttatcatcg 


actecgtett 


ctcatt cgga 


ggttgttc ag 


ggt gaaggga 


1 o or* 


at cat tc teg 


ccgggggt ac 


ggggagcagg 


^ ^ ^ #^ O 

CX.wCa.CCC.gd 


cgactctcgc 


gytytt'Cddg 


X ^ D 


cagctt ctcc 


cegt cgggga 


eaagccgatg 


o.Ti^t-.wdCX.dC.Lr 


cgct C t CCy t 


gctgatgctg 


J. J ^ 


gccggcgtca 


cggaeatcct 


^ 4* ^ ^ ^ ^ 

Caucaucagc 


dCdU-l^gCdL^Cj 


a ^ ^ ^ ^ ^ ^ ^ 

ddttCCCCCg 


ddtytytCyt 


1 "^Rfl 

J. ^ OVJ 


ctgttcggcg 


acggcgcaca 


gc u Cy gacT*c 


z*' ^ *^ f 4- 


dtgttgdgtd 


gydydddCtt 


JL ** •* U 


aggggtatcg 


cegaggcgt t 


ccugaucggu 


f^f^ a a 
googdL.LodL.^ 


tgggddgtgd 


tyttgttgt-y 




ctggcgctgg 


gegacaacat 


att ccacggg 


agt. uc u u u uc 


aggggg tgct 


gcgcddggaa 


J. 3 DU 


gccgaggaat 


tggaegggtg 


tgt cc ugn uc 


ggu uduccgg 


t caaggatcc 


ccagcgt t at 


J. u 


ggagtcggcg 


aggegaacgc 


gt ccgggcgg 


^ ^ ^ ?a 

CuCgt.Ct.CCd 


tcgaggagaa 


accggtacgc 


J.OOU 


ccccgct cca 


accgggccat 


caceggacte 


tattt etacg 


acaacgaggt 


g gt gg a e at e 




gcccggcggc 


tgcgcccct c 


cgcecgeggc 


gaaetegaaa 


teaccgacat 


ca a cegt ace 


XoUU 


tacatggaac 


gaggccgggc 


eeggct cgtg 


gacctgggcc 


ggggatt cge 


etggc t egac 


1 n^rt 
Xoou 


accggcacac 


ccgagtcact 


cctgcaggcc 


t cgcagt acg 


tgtccgccct 


ggaggaacgc 


1 Q o o 


cagggcatca 


ggatcgcctg 


cat egaggag 


gtggccct cc 


gcat gggett 


catcaaegce 


xy c5u 


caggcctgct 


acgaactggg 


cgcgcgeetg 


t ccggctccg 


get acgggca 


gtacgtgatg 




gc cat eg egg 


aggagtgcae 


ggggcgggtg 


tgagcggccg 


t gccggyugg 


gcgaacggec 


o 1 nn 


cggcctt acc 


cggecccgcg 


cacecegaeg 


ddCddCCCCC 


gg ccggtcag 


cccg t cgtc c 


Z X DU 


aggaact egg 


cegggcagec 


cgcgt cet eg 


ddcgcggcgd 


/^i^4* s 4*" ^ ^ 4~ ^ 

gg t dctcc tc 


cctggtgddC 


Z Z 4^ u 


agggtgagca 


ggt egat ct c 


cgtg aac teg 


cgtaucccgg 


tggcctcgcc 


gaecaggaac 


o o p n 
zz ou 


cgcaeetcca 


tgcgggtcct 


geggccctgc 


c t ggtggagt 


gggacacecg 


ggccacggt c 




cggccct cac 


cgcgtgccag 


guccccggcg 


d c gt dgc c 1 1 


CCdgyddCCy 


4* ^Y ^Y ^Y ^ ^ 

c t cggggaac 


o Ann 


caccagggct 


c cac cac gag 


cacgccgccc 


ggcdCCdggt 


gcgcggc cat 


cgtgcgcacc 


o A 

Z 4 DU 


gecgeccgca 


ug ucegcgae 


gg ue uceaga. 


udt't't-ydtgg 


Sit^csirxsiSir''Si/'^ 
dy tdy dd t dy 


gCdydCCdCy 


0 on 
Z 3 z u 


gegucgaaac 


f*V^ ^ ^^^^^ ^ s n 

gcecgcucag 


g^ C ^ d u. ^ 


U^wdt^tt'tt 


tyyyutytdt 


t^fit^^ A ^ 
cyytdtt-wurt 


0 c: Rn 


ggcagccgcc 


gi. ucggeeag 




tL-gtt-t^dt'd 


y 1 1 U t> dy y 1 1 


^4- ^/^■^ /^4* 

cttcgtytgtj 


Z OftVJ 


gegaaeagce 


cgcggaaggc 


ctccagatgg 


gcgccggt gc 


cgcaggegae 


gtc gag cage 


z / uu 


gaacgcgcce 


cgggcegacg 


ggaec uga uc 


u ccgcggtgd 


cccgu t cggc 


ct egt ecgce 


•5 •? #;n 
Z / oU 


cagctctttc 


ceeggctgeg 


gt agaccat c 


t cgtacacgt 


ccgccagtt e 


eeggcegt ac 


o Q o n 
ZoZU 


aegcgtcagt 


cct cgtccac 


cagggegacc 


gcccgggt cc 


acccggcgcc 


ggcgccggcg 


o Q on 


accttgaccg 


ggaagcagea 


gacgcggaac 


ccgaaggaga 


ecggcaggcg 


gtcgaggttc 


2940 


gccagccgct 


cgatctggca 


gtactcccge 


tceeggccca 


ccacgtgcgc 


gggccacage 


3000 


aecgatcggt 


cgceggtcgc 


geggtaccgg 


tcgatgatgt 


ggcegaaggg 


egegtceagg 


3060 


ctgaaggcat 


cggtcccgat 


cacccggacc 


ccgtggtcga 


gaagcatecg 


taccgcgggc 


3120 


ccgtcgagac 


cggcgaagtc 


cgtgaagtag 


cgcggggtgc 


ccgcgtgccg 


etgggcaccg 


3180 
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gcacgatgtc 


cccgggccgc 


aacgcgcacc 


cggtccgggc 


cagttccttc 


3240 




cggcgctcac 


ggtgcccgtc 


ggagcgtcgg 


tgaggtccag 


caccaccccg 


3300 




accactccag 


cggcat ctgg 


tcgatgtggc 


gggggacgcc 


gtccccgtac 


3360 




aacca'tagtig 


cgacggcgcg 


tcgacgtgcg 


tgccggtgtg 


cgtggtcagc 


3420 


j'n 1 « a +* ^ ft ¥" 


f^ f a ff^ ft o ^ ^ 

cc>a.gugaCa.g 


ft ^ ^ ft ft f^ f ft 
gciacucgccg 


t ccggcagtt 


cgtccggaga 


gaactcgaca 


3480 


C C Q ol AQ^'b Ct 


cgcgca'bc'bc 


cgcgcacatig 


tgttccgcgc 


cctgccgggg 


cgtgaggacg 


3540 




cc ggglicggg 


c c g't a ct g't 


gaggaabcca 


ccggtgacga 


aaggtcgatg 


3600 


Q ^ ^* 


gcaccbccgg 


g'tticgbagac 


gggctcggct 


gacgcagcgc 


gggtacgacg 


3660 






^ ft ft ^ ^ ^ ft ft a ^ 

L.ggo L> uggao. 


gctggtt cga 


cgggcgggca 


ccgcacgcga 


3720 


^^ft ^9 ft ft f* ft 


cgcaccggcg 


ccgticccggc 


cgagcgggaa 


tccagggagg 


gtatagcggc 


3780 




c ugccguca u 


gguga^gaaa 


ctigacggaca 


gcgagctggg 


gcgtgcgctg 


3840 




guggutacca 


g'bggc'tccgc 


ggcatccacc 


acgat cccta 


cgccctgctg 


3900 


ft f*ft ^ ft f* f^fr 


agagcgacga 


'bccggcgcag 


ctcggccggc 


tgctgcgtga 


acgcggccgg 


3960 


c c d c w a. 


gcgacaccgg 


caccuggguc 


accgcggacc 


atgcgacggc 


ctcccggctg 


4020 


^ frf* ft ft ^ ^ 
L« L. c ^ t; W ^ a. W C 


cgcgcttcgt 


gctgcgccgc 


ccgccggccg 


ggcccgccac 


cggcaccggg 


4080 


^ ci c Cf c a.'b g c 


cgtigggaaga 


ggccacgctg 


agcgacctgc 


tgcccctcga 


cgaggcgcgc 


4140 




accgggcacg 


g'tgccgccgg 


ctcggcgcga 


ccgccgcgcg 


gatcgcggcg 


4200 




bcgcgacgcg 


act cgcggac 


ctggccgggg 


cccgagccga 


acaggtgcgc 


4260 


^ S S ^ ft ft ft 


o ^ ^ ^* :a ^ 

ac u ucgacc u 


cagggccgac 


t acgccctcc 


cgtacgcggt 


cgagccggcc 


4320 




bcggcctgcc 


ggccgggcag 


Tigttccctct 


tcggcgcctt 


ctccccggcc 


4380 






cgtaccgc cc 


cgccttccgg 


aggcgcgcgc 


cctgatcgcc 


4440 






cct cbggccg 


cggctggccc 


cgagcctgtc 


gaagaccgtc 


4500 




aagcgccgga 


c ct c tLt c cb g 


ctgacggccg 


tgttactcgt 


accggccgtc 


4560 


^1 4* f^ ^ a ^^^'^ ftf* 


I-Cugcgaggc 


ggtcgccgcc 


ctgtcgcacg 


accccgggca 


ggccgggctg 


4620 




acccgg^acu 


cgccgcaccg 


gcggtcgagg 


agacgctgcg 


ccacgcaccg 


4680 




uy uuCaCCC u 


ccacgcgacc 


ggaccggagc 


gcgtcgcgga 


cgtcgacctc 


4740 


^ f*f%f^fwftf% f* 


ccgaggbcgc 


cgtcgtcgtg 


gcggcggcgc 


accgcgatcc 


ctcctggtgc 


4800 




s f*ft^'^ ^ft ^ 
accgcu ucgo. 


ccb caccagg 


aacgagcggc 


atctggcact 


gccgccggat 


4860 




ft ft ft ft ^ftr^ 

gggcrgcT-cgc; 


^> ft f^^" ft f^ ^ ft 
cccgc ugc ug 


cgcgtctgcg 


cgaccgcggc 


cgtcgcggcc 


4920 


ctcgcggccg 


gactcctccc 


gctgcgggcc 


gtcggcccgc 


ccgtacgacg 


gctgcgtgcc 


4980 


ccggtcaccc 


ggtccgtgct 


gcgcttcccc 


gtcgccccgt 


gctgagcagc 


ccctcctcac 


5040 


gtcatccccg 


gcccgccttc 


ccccgcccgc 


aacggaaggg 


actctccatg 


gacaaccgcg 


5100 


agaccgt.acg 


accggtgagc 


gtctgccggg 


tctgcggcgg 


caacgact.gg 


caggacgtcg 


5160 


tggacttcgg 


tgacgttccc 


ctcgccaacg 


gcttcctgtc 


cccggccgac 


tcctacgaga 


5220 
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acgagcgccg 


ctacccgctg 


ggcgtcctgt 


cct gccgcgc 


ctgccggctg 


atgagcckga 




cccacgtggt 


cgaccccgag 


gtgctgtacc 


gcgactacgc 


ctacaccacc 


cccgactccg 




aaatgatcac 


ccagcacatg 


cggcacatca 


ccgcgctgtg 


ccgcacccgt 


ttcgagcttc 


5400 


ccccggacag 


cctcgtcgtg 


gagctgggca 


gcaataccgg 


ccgtcagctc 


atggccttcc 


5460 


gcgaagcggg 


gatgcgcacc 


ctgggcgtgg 


accccgcgcg 


caacctcacg 


gacgtcgccc 


5520 


ggcgcaacgg 


caticgagacc 


ttccccgact 


t cttctccca 


cgacgtggcc 


cgcaccatcc 


bboU 


ggcgcgacca 


cgggcaggcg 


cggctcgtgc 


t:gggacggca 


tgtcttcgcc 


cacatcgacg 


5640 


acgtgtcgga 


catcgcggcc 


ggcgtacgcg 


aactcctgtc 


tcccgacggg 


gtgttcgcga 


5700 


tcgaggtgcc 


gtacgttctg 


gacctgctgg 


agaaggticgc 


gttcgacacc 


atctaccacg 


5760 


agcacttgtc 


gtacttcacc 


atgcggt cot 


tcgtcaccct 


cttcgcgcgc 


cacgggctgc 


5820 


gggtgctcga 


cgtggagcgg 


ttcggcgtgc 


acggcggatc 


ggtcctcgtc 


ttcgtgggcc 


5880 


acgaggacgg 


cccctggccc 


gaacgtccct 


ccgtccccga 


actgctgcgc 


gtggaacggc 


5940 


agcggggcct 


ct:acgacgac 


gccacct acc 


gcacgti'bcgc 


gcagcggatc 


gagcgggtgc 


bOOO 


gcaccgaact 


gccgga-actg 


ctgcgctccc 


tcgtggccca 


gggcaagcgc 


atcgtcggct 


6060 


acggt:gct:cc 


ggccaagggc 


aacacca'tcc 


'bcacggbgt^g 


cgggctcggc 


cbgaaggagc 


6120 


't g ga at: a cb g 


caccgacacc 


accgagcbga 


agcagggcag 


ggtgctgccc 


ggcacccaca 


6180 


'taccgg'tgca 


cgct cc eg ag 


cacgccaagg 


aacaca'tccc 


eg act act a c 


ctgttgctcg 




cctggaacta 


cgccacggag 


aticcbcgaca 


aggagacggc 


cttccgggac 


aacggcggcc 


6300 


ggt:t: c ail cgt: 


gccca't cccc 


cgcccgt: cga 


t: c c c a c g ti c 


cccgtcaggt 


tcctgaggcg 




cccgccgggc 


agcagcbgac 


gca'bcgccti c 


gcgcagggct 


gcacgccagt 


cgcggggcgg 


b420 


'tgcgacgccg 


accagccgcc 


agcggt cgtg 


cccgagcacc 


gtgcacgccg 


gccggggcgc 


b4oO 


cgggcccggc 


cggtcggccg 


t cgccaccgg 


gcgcacccg't. 


tccgggtccg 


cgcccgccag 


0040 


ccggaacacc 


tcccgggcca 


gctcgtacca 


ggtggccgcc 


ccggcgttgg- tggcgtggaa 


6600 


gaticccgcgc 


gcccgg'tctig 


gcggcgtgcg 


ggccagcgtc 


accagcagcc 


gggccacgtc 


6660 


accggcccac 


gt:cggcbgcc 


cccacLggtic 


g't'tgacgacg 


tcgacatggc 


cgtcgtccgg 


6720 


ggcacgct cc 


agca'bcg'bgc 


gcacgaagct: 


gcggccctgc 


ccgccgtaga 


gccacgccgt 


6780 


gcgcaccacg 


gtgcccgtat 


ccggcagcag 


cgacagcacg 


gcccgttccc 


cggccagttt 


c o A n 


gcbgcggccg 


tiacaccg'kgc 


gcgggcccgg 


agcgticcgac 


tcgccgtaag 


ggctgcgggt 


o900 


g'bcgcccggg 


aagaccftagt: 


cggt:cgagac 


gt:ggat:cagc 


cgtacgccgt 


ggcgcgcaca 


6960 


gcggcgggcc 


agcagccggg 


gcccgccgcc 


gttgacgcgc 


atcgcctccg 


cccaccgcga 


7020 


ctcggcgccg 


tccacgtccg 


tgaaggcggc 


gcagttgacc 


accacccgcg 


gccggtgcgc 


7080 


ggcgaacgcg 


gcgtccaccg 


cccgcccgtc 


ggtgatgtcc 


agcgcgcgcc 


gcccgagtac 


7140 


caccgcctcg 


gcggcgggcc 


ggctcctgcc 


ggtctccgcc 


agggccgcgg 


tcaggtgccg 


7200 


ggcgagcatg 


ccttctcctc 


cgg'tgaccag 


cacgcgcatc 


ccgctcaccg 


gaccccgggg 


7260 
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a.c Cf 3. c cf gti QQ 


ciCgL.dwL.gL^L' 


o rm c Q c c o t fi 

L^yyvywwy vy 


actccccgct 


tgagcggctc 


ccaccaggac 


7320 


Cgg u. t- c 1. 1- y 


n rT+~ a r'O a "tri 
gy u>dw^dt_> I— y 


oaccatcaaa 

CL w V \A W ^ 


cgcagccccg 


aggagaactc 


ccQcaccciaa 


7380 


/-« /->. /-w 4- "a /-y /~> f 


y \^ W W Vaf ^ W U. ^ 


ggccctgccc 


cagtccaggc 


tgtaacgcag 


gtcgtgcccc 


7440 




acaccitacca 

W W %4 w 


gacgctgctc 


cagtccgccc 


cgcacagctc 


cagcaacatia 


7500 




c c c crot t acra 


gagct cccgg 


ccgccgccga 


tgtggtacac 


accQccQQcrc 


7560 




^ \^ W 


gtccacgccc 


CQQcacrtiQQt: 


cctccacgtg 


cagccactcc 


7620 






^ Cft v Vi* ^ W 


accggcagcc 


cgt:ccaacaa 




7680 


--\ s M /if a 
aay Cy y a. 


i.gdg^ L. 


rrrnotoct era 

wy y y \^ \^^^ bl 


cgcgggccgt 


agti tz. g'b gg a 


acagcgggtiiC 


7740 




1^ ^ dy y w ^ y ^y 


t-y i-y *-yy ^yy 


caggcgaacg 


ccatcaggtc 




7800 


^ uggagg egg 


L^gu-d^ggggd 


n "t" 1" o a cr Q c t c 

y L« i« y y y y w w Vi^ 


agcgggtgcti 


cct ccggcca 


ggaaccggac 


7860 


gt^gd^gg ^g ^ 


i^y ^dydwk>> <— 


crtcccrtoaac 

^ w \M \^ %H W 


accaggacga 


agggCt CCaC 




7920 


dgt^goggog l> 


^ w sy w cty w y 


ctQcrataccQ 

W W 


acgacgttgg 


ticagcacgaa 




7980 


f^r*r^ ferret a ^ rro 
gcgcggd^gg 


a r*r^rm+* f^oa r? 
w y y w w u \_i> 


g'tgcgac'tcc 


y "^y ^^y »- 


aaacaacctQ 




8040 


nr^oa+" a f*T i~ 
gu^Lrfdi^i^ngw i— 


cert ccraccaci 

^ n 


ctcggcgt eg 




cccgcacgaa 


acacaaccacr 


8100 




cccrcCTt ccao 

^ 1^ ^ v,* ^1 




ttgcccgcgt 


acgtcagttt 


Qt-CQacraacQ 


8160 


gL.gd^gwg i-d 


larwy^^yyyy^ 


ccccgctccg 




t:ct:ccagcag 


ca'bgcgcaca 


8220 


x^dgg^^gdgi^ 


orf a "f* rra a a o c 

W y CL CX CL w ^ 


craccacacco 

\A \^ %j 


alioaccacicia 

y wy aw v.,a.yy a. 


tgttcacgtc 




8280 


g dg g ^ g l> g ^ g 


accrccatcrcia 
y w w u •»^9 y y 


'b't c c c c c a't 




ccataaaaca 


aaatQccfccc 


8340 


1^ U» w ^ CL 


a c cr t c a cf cr CI cr 

gu-g i.u.yyy yy 


c era c c n't QQ a 




aacttccrcjcQ 


CaQCQQCQCIC 


8400 


i^t^gci^wggwg 


guggt^uyg«.y 


t ccicccraact 

%^ ' w V-* %■» 


ccaacgg'k'tc 


ticgacggaac 


gaccaacggc 


8460 




ctacccaaac 

Kt* %J W 


agcccggccg 


acraQaQQQaQ 


gacccgttga 


gccgticagac 


8520 


acraaatccrt. c 
ag ciy q • y Utinr 


caccacsB.'tLCict 

\^ w ^ 


tgagcgcctt 


caacaccggc 


aaaaccQacQ 

u.^ ^ ^^^^ 


acgt:ggacga 


8580 


fT'^'ar^a't" r'f ar* 
g tvdwdiv^^wa^ 


^ Cft w V— ' 


'tcaa'tccggc 


caccttiggaa 


cacggcaticc 


acaccgggcc 


8640 


o d dg y ^ y v~- 


nr" f^oa rrot' on 

y ^ w ny w 


c C7 cr c t a a □'t 


gcgggcgacg 


'b^c'tccgagg 


aagcccgccb 


8700 


ncr a aa a cf cr ti □ 


cgga'bcgagg 


agcgcggccc 




gcctacctcg 


tgctctacgg 


8760 


cccrccacatc 


ggccggc't'bg 


t:cgg'ta'tgcc 


gcccaccgac 


cggcgcttct 


ccgg1:gaaca 


8820 


QCf t CI c a c c t a 


atigcgca't eg 


"bcgacgggaa 


ga^ccgcgac 


caccgggact 


ggcccgactt 


8880 


a rrnnn a rr 

w ^ ^y ^3 y 


c^occtcc acre 


'tcggcgaccc 


y^yy^^^y"-^ 




oacatcccT'tQ 

%js^ ^^^^ 


8940 


a o r~"t~ Ci 
a v« w y ^ \^ ^ w y 


aaacccrcacc 


cgacgagaca 


"bcagaccagg 


aaaaatacfct 
""y y »-vjy w ^x 


catgccggat 


9000 


cccggcggcc 


cgaccacggc 


cgagaacctg 


tcgaaggagg 


ctgtccgctt 


ctaccgcgag 


9060 


caggg'ttiacg 


tgcacatccc 


gcgcgtcctg 


tcggagacgg 


aggtgaccgc 


cttccgggcc 


9120 


gcctgtgagg 


aggtcctgga 


gaaggagggc 


cgcgagatct 


ccggcatcgc 


cctgcggctg 


9180 


gccggcgcgc 


ccctgcgggt 


ctacagcagc 


gacatcctgg 


tcaaggagcc 


caagcgcacc 


9240 


ctgcccaccc 


tggtccacga 


cgacgagacg 


ggactgccgc 


tgaacgagct 


gagtgccacg 


9300 
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ctgacggcct 
ccgggctccc 
ttccgggacc 
gtccgcgccg 
accagcgact 
cggccgggcg 
gagctgttcc 
gcgggaccgc 
ggccggcggc 
cgcgtcgtcc 
cccgatgccc 
gacgcactgc 
gatgatcgcg 
gttcgcgggc 
cgcgtccacg 
cgcgtcgatg 
gtccggcagc 
gcgccggtcc 
accggtcggc 
gagcgcgccg 
ttgcagcgct 
gggccgccag 
gaacatctcg 
ggtgtcgtcg 
cagcccggcc 
gagagcgcgc 
agtgatgaac 
gacggcgcct 
ggggccggtc 
tcgccgggag 
cgtcgtcacc 
ccggtccggc 
ctacgagcgg 
cgcggcgttc 



ggatcgcgct 
atctcagggc 
tcgcggacgt 
gagacgtcgt 
cggtccgcat 
tccaggacgg 
ccctggtcac 
cggggacggc 
ggcatctcgc 
agcagccgtc 
agcccggtca 
gggacgccgg 
tcgcacgtct 
agttcaccga 
tccgccactt 
gagcccagcg 
accgctccgc 
tccggcagca 
tcgccggtga 
agccccggct 
gccgccgtca 
Gtccgcgccg 
gcgaagaagc 
ggcacgaaca 
tccgcgacgg 
agcgcctgtg 
aacgcacgca 
tctgttgctc 
cggccacgac 
gcatcgatiga 
gggctactgg 
ctgcccggcg 
atgctggacg 
gccggggccg 



gacggacgta 
ccgcgaggac 
gtggcccgat 
gttccaccat 
ggcgcatggc 
ccacctgtcc 
ggcaggcaca 
accgtcagcc 
tcatctcctg 
ggcactgctc 
gcacatcgcg 
cggccagggc 
ccagcagcgc 
gcagggccag 
cggagaacag 
tcaccgcgat 
cgttgtaggg 
gcgcctccac 
cgccgtggcg 
cgctgtccgc 
gcgggaccgc 
tccgcagcgc 
cctcgcccag 
gcttcgcgta 
cgcggatgtc 
ccagcggcac 
tgatgccccc 
cggtcgacgc 
gcaggctcca 
cagtattggt 
ccgccggccg 
gcgcggagat 
gtgtcgaagc 
cgcgacgggc 



6 

cccgtcgaac 
cggcaggagc 
tacccgtggc 
tgccgtaccg 
gtcgtctaca 
cgcctgtcgc 
cggcagtgag 
ggccagcgcc 
ccgcacccgc 
ggcacccagc 
gttggtgtcc 
cgtcagcagt 
gcccagcgga 
gtcgccgccg 
cggggccagc 
acgccgccgg 
gacgtaccgc 
gctcggcggt 
ctcgtagtcc 
gggacccagc 
gccctgtgtc 
cccgtcgacg 
cccctcggag 
gttcacgccg 
gccccccgtg 
cgtcggcagg 
tgttcgacat 
ccggtcgaca 
gcggacgtcg 
aaccggagcc 
ccgggtgcgg 
cacaggcggc 
cgtctacctg 
cggtgtccgg 



gcggctgcat 
acatgaccag 
agccgcgcgt 
tccacatggc 
tggacgcgga 
cgggagatcc 
gtccgccgtt 
acgagcttgg 
agggccgcct 
gatcccgcct 
tggtaggagc 
gtgccgctgc 
atccact.cca 
cccagggtca 
ttggcgatgc 
ccggccgcgg 
atcggccagg 
gtcgtcgtca 
ttggacatcg 
ggcagctcta 
ggagtgtgca 
gccaccgccg 
tgcatcgggt 
ggcgacacgt 
gcgtagcgga 
atgtggctga 
gaacctggaa 
ggccctcgta 
acggcgggga 
acaggaaacg 
gcgctgaccc 
gacctgaccc 
ttccccgtcc 
cggatcgtgg 



gagctacgtg 
cttcgccgag 
cgccgtgccc 
cgaagccaac 
cgccacctac 
actcgaaggc 
cccggcggtc 
cggccgtctc 
cccgcaaccc 
cggcatcgaa 
cgtgcgggat 
ccccgtgatg 
ccaccggtac 
gcacgaactc 
cgcccgacag 
gcggcagcca 
cacccgggga 
gccgcacgga 
cccgccggat 
cgcacggcag 
cgacgaggtc 
atacccgggc 
cggtgacgtc 
ccacggcgca 
cctcgtggcc 
gcccgggtga 
cacgcatcct 
cagcccgccg 
cgcagcgtgg 
tcggccggca 
gcacacccga 
gcccggagac 
cggagaccgc 
tgctctcctc 



9360 
9420 
9480 
9540 
9600 
9660 
9720 
9780 
9840 
9900 
9960 
10020 
10080 
10140 
10200 
10260 
10320 
10380 
10440 
10500 
10560 
10620 
10680 
10740 
10800 
10860 
10920 
10980 
11040 
11100 
11160 
11220 
11280 
11340 



wo 00/24775 



PCT/FI99/00870 



ggactccgtc 
ggacacgggg 
cctgtgggcg 
ggtggccccg 
ggggcacgcc 
ggtccgcgcg 
gcaggcgcgg 
cgccttccaa 
caccggccgg 
gtgaccggag 
tgacgtcctt 
ggtcggccgg 
gagccggcct 
gggtcggcag 
acgccctggg 
cgcagatcaa 
ccgacctggt 
gcgccgccca 
tcctggagcg 
tcgactggac 
ggtccatcga 
tgcggtacgt 
cgcccgagcg 
tcctcgccgt 
tcgccaccct 
tcgtcgagca 
gcggagccgg 
cgatgtggga 
tgccctccgg 
agccctcctt 
ccagcgaggt 
nnnnnnnnnn 
nnnnnnnnnn 
1:gagggagcc 



accgacggca 
ctcgagtgga 
ccgtcgatcc 
gtgcacgagg 
ggccgcgcct 
gtaggggagg 
gccgacctga 
gccgggtgga 
cccgcccgca 
accgcgtcca 
cgccatggac 
gcacgaggta 
gaccgccgtg 
cgacatcctc 
cttcgagttc 
caacgactcg 
cgtctgggag 
cgcccgcctg 
catggcgcgc 
ccttggccgg 
ccagaccccc 
cccctacagc 
gccacgggtc 
ctccgtggac 
cgacgccgac 
cgtgccgctg 
cacctggtcg 
ccacttctac 
cgagctgact 
cggcaccgcc 
cgtgccggtc 
nnnnnnnnnn 
nnnnnnnnnn 
cggat-cacag 



ccgacaccgg 
cccatgtgcg 
gcgcggaggg 
ccgacgtcgc 
acagcgtgac 
ggctcggccg 
ccgcccaggg 
ccgagcggcc 
cgctcgccca 
ccgcgccacg 
gcccacttct 
cgggtggccg 
ccggtcggcc 
gccctgcacg 
ctcaaagggc 
atggtcgacg 
ccgttcacct 
ctgtccttcc 
caggagcccg 
cacggccact 
gcccccgtgc 
ggactggtgc 
ctggtcaccc 
gaccttttcg 
cagcgggagc 
gacgccgttc 
acggccgccg 
cgggcccgtc 
gccgaggggc 
gcgcaggcgc 
ctggaggagc 
nnnnnnnnnn 
nnnnnnnnnn 
tccgtccggc 



7 

aggacaccgg 
ccccggcgag 
cgtcgtccgg 
ggccgtcgcg 
cggaccgcag 
gtccctcgcc 
gctgcccgcg 
cgcccccgcc 
gtgggccgcc 
acagaaaggc 
gcaccgccgt 
gccagcccgc 
gcgaccacac 
acgaggcgga 
acaacacggt 
acctggtgga 
tcgcgggcgc 
ccgacctgtt 
agcatcacga 
ccttcgacga 
ggctcgacgc 
ccacagtggt 
tcggcatcac 
aggccgtggc 
tgctggggcg 
tgccgacctg 
tgtacggggt 
gcctggagga 
tgcgcacccg 
tgagcgacac 
tgaccggacg 
nnnnnnnnnn 
nnnnnnnnnn 
accacgccca 



cgcgtggaac 
ttcgcgctca 
tccgcgtatc 
gtgaccgcgc 
gccctcaccc 
ttcgtcgagg 
cccatcgccg 
cggccgaccg 
gaccaccgag 
gacgcccgtg 
gccgctggcg 
gctcacctcc 
gcacggcagc 
ctacctggag 
gatgtccgcg 
cttcgcccgt 
cgtggccgcg 
cctcagcacg 
cgacacactc 
ggagatcgtc 
cggcggtccc 
gcccgactgg 
ctcacggcgg 
cgggctcggc 
cgtgccggac 
ctcggcgatc 
gccgcaggtc 
actcggggcg 
gctggagagg 
catcgcggcg 
gcaccgtccc 
nnnnnnnnnn 
ccgtccgggc 
ggtcccggaa 



tggccgtgga 
acaaggtcac 
cggacgcccg 
tgctgaagga 
agcgcgaaca 
tgacccccgg 
actacgtcct 
tgcgggaggt 
cggacttccg 
cgcgtactgc 
tgggcactgc 
accatcacgg 
ctcctgggcc 
gcccgtcacg 
ttgttctact 
cactggcggc 
cgggcctcgg 
cgccgcctct 
gccgaatggc 
acggggcagt 
accgtgccga 
ctgcgcaggc 
gtgaagtcct 
gtcgaggtgg 
cacttccgca 
gtccaccacg 
tccctgggct 
gggctgcggc 
gtgctcggcg 
gaacccagcc 
ggcacccggg 
nnnnnnnnnn 
ccctcgccgg 
cagcggggag 



11400 
11460 
11520 
11580 
11640 
11700 
11760 
11820 
11880 
11940 
12000 
12060 
12120 
12180 
12240 
12300 
12360 
12420 
12480 
12540 
12600 
12660 
12720 
12780 
12840 
12900 
12960 
13020 
13080 
13140 
13200 
13260 
13320 
13380 
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aagt:tgaaga 


cgtcccagtg 


ct ccacgacc 




cggdgddgwy 


a /-T +- r^\- r* 
Odytv L. L_ 


13440 


aagtaggtcc 


agcggacctt 


gcggccggtg 


ggggcgatgc 


CJCduyddL^dL^ 


yooL^ u.yy wy^^ 


13500 


gtggccgagc 


aggtgatccg 


cagcaucacg 






(~t /— • 4- /~» /-« (~» fti 
y u> L.v_r^yL>d^y 


13560 


tccagacgaa 


ggtccgggaa 


ggcctccacc 


*^ "V" / « !■ « 3 

gcgcuguuca 


x-d^y^^g i.du< 


/-^ a f"' 4- O ^ /~T 

y d o o i^w^ ^u>y 


13620 


gcgctcaccg 


gtutgtcctc 


g ucgucg 




wyyy i^y^L^^ci 


ni~ rti^rtr^n^ c-r* 
y w y w y ^ y ^ 


13680 


accccggaga 


cgt cccaccg 


guucca^gcg 




^^dyyt^dyi^y 




13740 


gcggtcgttg 


acatgtcgac 


ccc^ ugaagg 


w c c g gg a u d 


L> ^yy ^^d^yu« 


y l_ cLy O l_« L« L> 


13800 


caacccgccc 


cggaaaagcg 


gtgcacgacc 


gctggagccc 


gcaccggaac 


cugcgcggcg 




gagctgaacg 


gggtttcgag 


ccgnucacca 


dy y ctu> cy ^ 


yL>dyc>w 1— y ^ ^ 


d^ y y w> d CL ^ v— 


13920 


ctgacgcctc 


gctccgcgcg 


ggacgcgccc 


gccgggagga 


aggacacdcc 


d cj c d T. g u gg 


J. J 7 OU 


tacgcaccga 


tcagacggcg 


gcaccggaag 


accgagcggc 


ggccacggat 


cccy gg^ ucg 




ggcacctgta 


cgcgcaggtg 


cagcagttct 


acgcccggca 


gatgcagctc 


ctcgactccg 


1 A 1 nn 

i** J. w\J 


gcgcggccga 


ggagtgggcc 


gccaccttca 


ccgaggacgg 


Cdcgx.^cgcc 


^ n ^ ^ ^ 

L^ggou>wi^d^u- 


^ *t X V w 


cgccggaacc 


ggcacgcggc 


cacgccgaac 


tggccgccgg 


cgcccgcgcc 


y o ^y c^y ddu- 


X *• ^ ^ u 


gcctcgccgc 


cgagggcctt 


tcgcaccggc 




Cd L.gdccgcg 


"4— ^ r"* ft 

yi^du>yu.L<'yyy 




aacccgacgg 


cagcgtgttc 


gtacgcagct 


acgcccaggu 


^ +- ^f^^ ^ a 

cx-ucgcwdcc 


u.yu't^yv'yyyy 


i *» J w 


aagctccccg 


gctgcatctg 


atctgcgt ct 


gcgaggacgi:. 


g c u w g u. g y y 


y d y y y y L« y y 


X •* *T w w 


ggctgaaggt 


gcgggaacgg 


gttgtcacgc 


acga-cgcgug 


dy y y oy y i.L^y 


dt_>OL-yL-.L-.yyi_ 


14460 


cgagccgcac 


ct ctgccacc 


ccc^cggcac 


y C dy w y y ^ 


y L«y dy wy 


^ I. y y dy dy y 


14520 


gcgcacttag 


cgtgcgagcc 


/i-w ^ j*^ ^ -4- 

a ugacugac u 


g Ct o d y y L. ^ 


o^yo^i.^yyi^y 


^ ^ ^y w ^ ci. i_- y w 


14580 


cacccgcccc 


cagccccacg 


+^ /~* /-^ /-^ 
ccuucccccg 




^yyydyL^ydd 


\^ ^4 w *j ^ w y W 


14640 


tcgccgtgat. 


cgticaccggc 


ggcggttcgg 


guducggccg 


ygc>c>d(^c-y(.'^ 


L^yu^yoL^ i-^i^y 


X *x / \J\J 


ccgcbcaggg 


tgcgaaggtg 


Cucgucy X- eg 


n r* r^e^^ a /~» rr ii 
yOOyWdL'L'gd 


yydwy^yurf l.^ 


y wy ^ d^ w y 


14760 


ccgagggctg 


tgcggacatg 


cgt gt get eg 


^cgccgacgu 


ggc c^cgccc 


Si t~t fy (T z^* 
yd^yyy^u>y^ 


X *V w 


aggcggt cgt 


caacgccgcc 


ctgcgggagt 


ucgggaggau 


cgdcg uccug 


y l-LJddC_ddOy 


14880 


ctgccgtggc 


gggcatggag 


accctgcaga 


ccgt cgaccg 


ggacgccgi,g 


gcdcggcdgu 


1 AQACi 


t:cggcaccaa 


tctgacggct 


CCCCtCu uCC 


ucguccag^c 


a /~T /~T 

cgcdc ucggc 


y u> y c L.y y dy d 




agtcgcgcgg 


c at cgt cgt c 


aacgtgggga 


ccgccgcgdc 


CL.i.yyyu.^L.y 


^yoyi^L^y^^^ 


15060 


cgaccggcgc 


gctgtacggg 


gcgagcaagg 


uggccc^cga 


wudi.'^uyd^k.' 


L-yyd^L^ i>yyy 


1 c 1 on 

X ^ X ^ u 


ccgtcgaact 


ggccccccgg 


ggcatccgtg 


tcgtcggcgt 


ggcacccggg 


gtgatcgaca 


15180 


cgggcatcgg 


cgtccgcatg 


ggcatgaccc 


cggagggcta 


ccgggagttc 


ctgaccggca 


15240 


t:gggcggcag 


ggtgcccgtg 


ggccgggtcg 


gccgtccgga 


ggacgtggcc 


tggtggatcg 


15300 


tccagctcgc 


ccgcccggag 


gccggctacg 


cgacgggcat 


ggtcgtcccc 


gtcgacggcg 


15360 


ggctgtcgct 


ggtctgaccg 


gacaaggaag 


gaaataccgc 


aggaaggaag 


taccgcagca 


15420 
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aggaaatiacc 


gcaggaagga 


gatatcgccg 


tgcaggaaac 


cgaacccggc 


gtccccgcgg 


15480 


acctgcccgc 


cgagagcgac 


cctgccgccc 


tggagcgcct 


cgccgcacgg 


taccggcggg 


15540 


acggctacgt 


ccacgtcccc 


ggcgtcctcg 


acgccgggga 


ggtcgccgaa 


tacctggccg 


15600 


aggcccgtcg 


gctcctcgcc 


cacgaggagt: 


ccgtgcgctg 


gggctccggc 


gccggcaccg 


15660 


tcatggacta 


cg't cgccgac 


gcccagct eg 


gcagcgacac 


gatigcgccgc 


cttgccaccc 


lb 7 ^O 


acccgcgcat 


cgccgccctc 


gccgagtacc 


tggccggctc 


gcccctgagg 


ctgttcaagc 


15780 


tggaggtgct 


gctcaaggag 


aacaaggaga 


aggacgcctc 


ggtccccacc 


gccccgcacc 


15840 


acgatgcgtt 


cgccttcccg 


ttctccaccg 


ccggcaccgc 


cctgacggcg 


tgggtcgcgc 


15900 


tggtcgacgt 


cccggtggaa 


cgcggctgca 


tgaccttcgt 


ccccggatca 


cacctgctgc 


15960 


cggatcccga 


taccggcgac 


gagccgtggg 


ccggggcctt 


cacccggccg 


ggagagatct 


16020 



<210> 2 
<211> 342 
<212> PRT 

<213> Streptomyces nogalater ATCC 27451 

<220> 

<223> "translate of snogl , function: aminotransferase" 

<400> 2 

Met Thr Val His Val Trp Asp Tyr Leu Pro Glu Tyr Glu Leu Glu Arg 
15 10 15 

Glu Asp lie His Asp Ala Val Glu Thr Val Phe Arg Ser Gly Arg Leu 
20 25 30 

Val Leu Gly Glu Ser Val Arg Gly Phe Glu Ser Glu Phe Ala Ser Phe 
35 40 45 

Gin Gly Val Gly His Ala Val Gly Val Asp Asn Gly Thr Asn Ala Val 
50 55 60 

Lys Leu Gly Leu Gin Ala Leu Gly Val Gly Pro Gly Asp Glu Val Val 
65 70 75 80 

Thr Val Ser Asn Thr Ala Ala Pro Thr Val Val Ala lie Asp Ser Ala 
85 90 95 ' 

Gly Ala Thr Pro Val Phe Val Asp Val Arg Glu Glu Asp Tyr Leu Met 
100 105 110 

Asp Thr Ser Gin Val Glu Ala Val Leu Thr Pro Arg Thr Arg Cys Leu 
115 120 125 

Leu Pro Val His Leu Tyr Gly Gin Cys Val Asp Met Ala Pro Leu Arg 
130 135 140 

Asp Leu Ala Ala Arg His Asn Leu Val lie Leu Glu Asp Cys Ala Gin 
145 150 155 160 

Ala His Gly Ala Arg Arg His Gly Arg Leu Ala Gly Ser Thr Gly Asp 
165 . 170 175 

Ala Ala Ala Phe Ser Phe Tyr Pro Thr Lys Val Leu Gly Ala Tyr Gly 
180 185 190 
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Asp Gly Gly Ala Val Leu Thr Asp Asp Glu Arg Val Ala Asp Arg Leu 
195 200 205 

Arg Arg Leu Arg Tyr Tyr Gly Met Glu Ser Arg Tyr Tyr Val Val Glu 
210 215 220 

Thr Pro Gly His Asn Ser Arg Leu Asp Glu Val Gin Ala Glu lie Leu 
225 230 235 240 

Arg Arg Lys Leu Ser Arg Leu Pro Ser Tyr lie Glu Ala Arg Arg Ala 
245 250 255 

Val Ala Arg Arg Tyr Glu Glu Gly Leu Ala Asp Thr Gly Leu Leu Leu 
260 265 270 

Pro Arg Thr Ala Gin Gly Asn Glu His Val Tyr Tyr Val Tyr Val Val 
275 280 285 

Arg His Pro Arg Arg Asp Ala Val Leu Glu Ala Leu Arg Ala Ser Tyr 
290 295 300 

Asp lie Ala Leu Asn lie Ser Tyr Pro Trp Pro Val His Thr Met Thr 
305 310 315 320 

Gly Phe Ser His Leu Gly Tyr Ala Lys Gly Ser Leu Pro Val Thr Glu 
325 330 335 

Ala Leu Ala Asp Glu lie 
340 

<210> 3 
<211> 293 
<212> PRT 

<213> Streptomyces nogalater ATCC 27451 

<220> 

<223> "translate of snogJ , function: dTDP-glucose synthase" 



<400> 



Val Lys Gly lie lie Leu Ala Gly Gly Thr Gly Ser Arg Leu His Pro 
15 10 15 

Thr Thr Leu Ala Val Ser Lys Gin Leu Leu Pro Val Gly Asp Lys Pro 
20 25 30 

Met lie Tyr Tyr Pro Leu Ser Val Leu Met Leu Ala Gly Val Thr Asp 
35 40 45 

lie Leu lie lie Ser Thr Pro His Glu Leu Pro Arg Met Arg Arg Leu 
50 55 60 

Phe Gly Asp Gly Ala Gin Leu Gly Leu Arg Leu Ala Tyr Ala Glu Gin 
65 70 75 80 

Glu Lys Pro Arg Gly lie Ala Glu Ala Phe Leu lie Gly Ala Asp His 
85 90 95 

Val Gly Ser Asp Ala Val Ala Leu Ala Leu Gly Asp Asn lie Phe His 
100 105 110 

Gly Ser Ser Phe Gin Gly Val Leu Arg Lys Glu Ala Glu Glu Leu Asp 
115 120 125 

Gly Cys Val Leu Phe Gly Tyr Pro Val Lys Asp Pro Gin Arg Tyr Gly 
130 135 140 
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Val Gly Glu Ala Asn Ala Ser Gly Arg Leu Val Ser lie Glu Glu Lys 
145 150 155 160 

Pro Val Arg Pro Arg Ser Asn Arg Ala lie Thr Gly Leu Tyr Phe Tyr 
165 170 175 

Asp Asn Glu Val Val Asp lie Ala Arg Arg Leu Arg Pro Ser Ala Arg 
180 185 190 

Gly Glu Leu Glu lie Thr Asp lie Asn Arg Thr Tyr Met Glu Arg Gly 
195 200 205 

Arg Ala Arg Leu Val Asp Leu Gly Arg Gly Phe Ala Trp Leu Asp Thr 
210 215 220 

Gly Thr Pro Glu Ser Leu Leu Gin Ala Ser Gin Tyr Val Ser Ala Leu 
225 230 235 240 

Glu Glu Arg Gin Gly lie Arg lie Ala Cys lie Glu Glu Val Ala Leu 
245 250 255 

Arg Met Gly Phe lie Asn Ala Gin Ala Cys Tyr Glu Leu Gly Ala Arg 
260 265 270 

Leu Ser Gly Ser Gly Tyr Gly Gin Tyr Val Met Ala lie Ala Glu Glu 
275 280 285 

Cys Thr Gly Arg Val 
290 

<210> 4 

<211> 238 

<212> PRT 

<213> Streptomyces nogalater ATCC 27451 
<220> 

<223> "translate of snogA, function: aminomethyl transferase" 



<400> 



Val Tyr Gly Arg Glu Leu Ala Asp Val Tyr Glu Met Val Tyr Arg Ser 
15 10 15 

Arg Gly Lys Ser Trp Ala Asp Glu Ala Glu Arg Val Thr Ala Glu lie 
20 25 — 30 

Arg Ser Arg Arg Pro Gly Ala Arg Ser Leu Leu Asp Val Ala Cys Gly 
35 40 45 

Thr Gly Ala His Leu Glu Ala Phe Arg Gly Leu Phe Ala His Thr Glu 
50 55 60 

Gly Leu Glu Leu Ser Asp Glu Met Arg Ala Leu Ala Glu Arg Arg Leu 
65 70 75 80 

Pro Gly Val Pro Val Arg Pro Gly Asp Met Arg Asp Phe Ala Leu Ser 
85 90 95 

Gly Arg Phe Asp Ala Val Val Cys Leu Phe Cys Ser lie Gly Tyr Leu 
100 105 110 

Glu Thr Val Ala Asp Met Arg Ala Ala Val Arg Thr Met Ala Ala His 
115 120 125 

Leu Val Pro Gly Gly Val Leu Val Val Glu Pro Trp Trp Phe Pro Glu 
130 135 140 
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Arg Phe Leu Glu Gly Tyr Val Ala Gly Asp Leu Ala Arg Gly Glu Gly 
145 150 155 160 

Arg Thr Val Ala Arg Val Ser His Ser Thr Arg Gin Gly Arg Arg Thr 
165 170 175 

Arg Met Glu Val Arg Phe Leu Val Gly Glu Ala Thr Gly He Arg Glu 
180 185 190 

Phe Thr Glu He Asp Leu Leu Thr Leu Phe Thr Arg Glu Glu Tyr Leu 
195 200 205 

Ala Ala Phe Glu Asp Ala Gly Cys Pro Ala Glu Phe Leu Asp Asp Gly 
210 215 220 

Leu Thr Gly Arg Gly Leu Phe Val Gly Val Arg Gly Ala Gly 
225 230 235 

<210> 5 

<211> 324 

<212> PRT 

<213> StreptomycBs nogalater ATCC 27451 
<220> 

<223> "translate of snoaH, function: polyketide cyclase" 



<400> 



Met Thr Ala Ala Trp Gly Ala Pro Leu Tyr Pro Pro Trp He Pro Ala 
15 10 15 

Arg Pro Gly Arg Arg Arg Cys Gly Ala Gly Arg Arg Val Arg Cys Pro 
20 25 30 

Pro Val Glu Pro Ala Ser Arg Pro Arg Gin Glu Gly Arg Val Ser Val 
35 40 45 

Val Pro Ala Leu Arg Gin Pro Ser Pro Ser Thr Asn Pro Glu Val Arg 
50 55 60 

Val Arg Leu He Asp Leu Ser Ser Pro Val Asp Ser Ser Gin Tyr Glu 
65 70 75 80 

Pro Asp Pro Val Val His Asp Val Leu Thr Pro Arg Gin Gly Ala Glu 
85 90 95 

His Met Cys Ala Glu Met Arg Glu His Phe Gly Val Glu Phe Ser Pro 
100 105 110 

Asp Glu Leu Pro Asp Gly Glu Phe Leu Ser Leu Asp Arg He Thr Leu 
115 120 125 

Thr Thr His Thr Gly Thr His Val Asp Ala Pro Ser His Tyr Gly Ser 
130 135 140 

Arg Ala Leu Tyr Gly Asp Gly Val Pro Arg His He Asp Gin Met Pro 
145 150 155 160 

Leu Glu Trp Phe Phe Gly Arg Gly Val Val Leu Asp Leu Thr Asp Ala 
165 170 175 

Pro Thr Gly Thr Val Ser Ala Ala Arg Leu Glu Lys Glu Leu Ala Arg 
180 185 190 

Thr Gly Cys Ala Leu Arg Pro Gly Asp He Val Leu Leu His Thr Gly 
195 200 205 
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Ala Gin Arg His Ala Gly Thr Pro Arg Tyr Phe Thr Asp Phe Ala Gly 
210 215 220 

Leu Asp Gly Pro Ala Val Arg Met Leu Leu Asp His Gly Val Arg Val 
225 230 235 240 

lie Gly Thr Asp Ala Phe Ser Leu Asp Ala Pro Phe Gly His lie lie 
245 250 255 

Asp Arg Tyr Arg Ala Thr Gly Asp Arg Ser Val Leu Trp Pro Ala His 
260 265 270 

Val Val Gly Arg Glu Arg Glu Tyr Cys Gin lie Glu Arg Leu Ala Asn 
275 280 285 

Leu Asp Arg Leu Pro Val Ser Phe Gly Phe Arg Val Cys Cys Phe Pro 
290 295 300 

Val Lys Val Ala Gly Ala Gly Ala Gly Trp Thr Arg Ala Val Ala Leu 
305 310 315 320 

Val Asp Glu Asp 

<210> 6 

<211> 408 

<212> PRT 

<213> Streptomyces nogalater ATCC 27451 
<220> 

<223> "translate of s/iogN, function: unknown" 



<400> 



Met Val Met Lys Leu Thr Asp Ser Glu Leu Gly Arg Ala Leu Leu Ser 
15 10 15 

Leu Arg Gly Tyr Gin Trp Leu Arg Gly lie His His Asp Pro Tyr Ala 
20 25 30 

Leu Leu Leu Arg Ala Glu Ser Asp Asp Pro Ala Gin Leu Gly Arg Leu 
35 40 45 

Leu Arg Glu Arg Gly Arg Leu His Arg Ser Asp Thr Gly Thr Trp Val 
50 55 60 

Thr Ala Asp His Ala Thr Ala Ser Arg Leu Leu Ala Asp Pro Arg Phe 
65 70 75 80 

Val Leu Arg Arg Pro Pro Ala Gly Pro Ala Thr Gly Thr Gly Asp Val 
85 90 95 

Met Pro Trp Glu Glu Ala Thr Leu Ser Asp Leu Leu Pro Leu Asp Glu 
100 105 110 

Ala Arg Leu Thr Thr Asp Arg Ala Arg Cys Arg Arg Leu Gly Ala Thr 
115 120 125 

Ala Ala Arg lie Ala Ala Asp Gly Pro Val Ala Thr Arg Leu Ala Asp 
130 135 140 

Leu Ala Gly Ala Arg Ala Glu Gin Val Arg Ser Thr Gly His Phe Asp 
145 150 155 160 

Leu Arg Ala Asp Tyr Ala Leu Pro Tyr Ala Val Glu Pro Ala Cys Ala 
165 170 175 
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Leu Leu Gly Leu Pro Ala Gly Gin Cys Ser Leu Phe Gly Ala Phe Ser 
180 185 190 

Pro Ala Val Leu Leu Asp Ala Thr Val Val Pro Pro Arg Leu Pro Glu 
195 200 205 

Ala Arg Ala Leu lie Ala Ser Thr Ala Glu Leu Thr Ala Leu Trp Pro 
210 215 220 

Arg Leu Ala Pro Ser Leu Ser Lys Thr Val Pro Glu Asp Glu Ala Pro 
225 230 235 240 

Asp Leu Phe Leu Leu Thr Ala Val Leu Leu Val Pro Ala Val Val His 
245 250 255 

Leu Val Cys Glu Ala Val Ala Ala Leu Ser His Asp Pro Gly Gin Ala 
260 265 270 

Gly Leu Leu Arg Asp Asp Pro Val Leu Ala Ala Pro Ala Val Glu Glu 
275 280 285 

Thr Leu Arg His Ala Pro Pro Ala Arg Leu Phe Thr Leu His Ala Thr 
290 295 300 

Gly Pro Glu Arg Val Ala Asp Val Asp Leu Pro Ala Gly Ala Glu Val 
305 310 315 320 

Ala Val Val Val Ala Ala Ala His Arg Asp Pro Ser Trp Cys Pro Asp 
325 330 335 

Pro Asp Arg Phe Asp Leu Thr Arg Asn Glu Arg His Leu Ala Leu Pro 
340 345 350 

Pro Asp Leu Pro Leu Gly Ala Leu Ala Pro Leu Leu Arg Val Cys Ala 
355 360 365 

Thr Ala Ala Val Ala Ala Leu Ala Ala Gly Leu Leu Pro Leu Arg Ala 
370 375 380 

Val Gly Pro Pro Val Arg Arg Leu Arg Ala Pro Val Thr Arg Ser Val 
385 390 395 400 

Leu Arg Phe Pro Val Ala Pro Cys 
405 

<210> 7 

<211> 422 

<212> PRT 

<213> Streptomyces nogalater ATCC 27451 
<220> 

<223> "translate of S72oaG, function: hydroxylase" 



<400> 



Met Asp Asn Arg Glu Thr Val Arg Pro Val Ser Val Cys Arg Val Cys 
15 10 15 

Gly Gly Asn Asp Trp Gin Asp Val Val Asp Phe Gly Asp Val Pro Leu 
20 25 30 

Ala Asn Gly Phe Leu Ser Pro Ala Asp Ser Tyr Glu Asn Glu Arg Arg 
35 40 45 

Tyr Pro Leu Gly Val Leu Ser Cys Arg Ala Cys Arg Leu Met Ser Leu 
50 55 60 



wo 00/24775 



PCT/FI99/00870 



15 

Thr His Val Val Asp Pro Glu Val Leu Tyr Arg Asp Tyr Ala Tyr Thr 
65 70 75 80 

Thr Pro Asp Ser Glu Met lie Thr Gin His Met Arg His lie Thr Ala 
85 90 95 

Leu Cys Arg Thr Arg Phe Glu Leu Pro Pro Asp Ser Leu Val Val Glu 
100 105 110 

Leu Gly Ser Asn Thr Gly Arg Gin Leu Met Ala Phe Arg Glu Ala Gly 
115 120 125 

Met Arg Thr Leu Gly Val Asp Pro Ala Arg Asn Leu Thr Asp Val Ala 
130 135 140 

Arg Arg Asn Gly lie Glu Thr Phe Pro Asp Phe Phe Ser His Asp Val 
145 150 155 160 

Ala Arg Thr lie Arg Arg Asp His Gly Gin Ala Arg Leu Val Leu Gly 
165 170 175 

Arg His Val Phe Ala His lie Asp Asp Val Ser Asp lie Ala Ala Gly 
180 185 190 

Val Arg Glu Leu Leu Ser Pro Asp Gly Val Phe Ala lie Glu Val Pro 
195 200 205 

Tyr Val Leu Asp Leu Leu Glu Lys Val Ala Phe Asp Thr lie Tyr His 
210 215 220 

Glu His Leu Ser Tyr Phe Thr Met Arg Ser Phe Val Thr Leu Phe Ala 
225 230 235 240 

Arg His Gly Leu Arg Val Leu Asp Val Glu Arg Phe Gly Val His Gly 
245 250 255 

Gly Ser Val Leu Val Phe Val Gly His Glu Asp Gly Pro Trp Pro Glu 
260 265 270 

Arg Pro Ser Val Pro Glu Leu Leu Arg Val Glu Arg Gin Arg Gly Leu 
275 280 285 

Tyr Asp Asp Ala Thr Tyr Arg Thr Phe Ala Gin Arg lie Glu Arg Val 
290 295 300 

Arg Thr Glu Leu Pro Glu Leu Leu Arg Ser Leu Val Ala Gin Gly Lys 
305 310 315 320 

Arg lie Val Gly Tyr Gly Ala Pro Ala Lys Gly Asn Thr lie Leu Thr 
325 330 335 

Val Cys Gly Leu Gly Leu Lys Glu Leu Glu Tyr Cys Thr Asp Thr Thr 
340 345 350 

Glu Leu Lys Gin Gly Arg Val Leu Pro Gly Thr His lie Pro Val His 
355 360 365 

Ala Pro Glu His Ala Lys Glu His lie Pro Asp Tyr Tyr Leu Leu Leu 
370 375 380 

Ala Trp Asn Tyr Ala Thr Glu lie Leu Asp Lys Glu Thr Ala Phe Arg 
385 390 395 400 

Asp Asn Gly Gly Arg Phe lie Val Pro lie Pro Arg Pro Ser lie Leu 
405 410 415 
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Thr Ser Pro Ser Gly Ser 
420 

<210> 8 
<211> 291 
<212> PRT 

<213> Stjreptomyces nogalater ATCC 27451 

<220> 

<223> "translate of snogC, function: dTDP-4-dehydrorhamnose reductase" 

<400> 8 

Met Leu Ala Arg His Leu Thr Ala Ala Leu Ala Glu Thr Gly Arg Ser 
15 10 15 

Arg Pro Ala Ala Glu Ala Val Val Leu Gly Arg Arg Ala Leu Asp lie 
20 25 30 

Thr Asp Gly Arg Ala Val Asp Ala Ala Phe Ala Ala His Arg Pro Arg 
35 40 45 

Val Val Val Asn Cys Ala Ala Phe Thr Asp Val Asp Gly Ala Glu Ser 
50 55 60 

Arg Trp Ala Glu Ala Met Arg Val Asn Gly Gly Gly Pro Arg Leu Leu 
65 70 75 80 

Ala Arg Arg Cys Ala Arg His Gly Val Arg Leu lie His Val Ser Thr 
85 90 95 

Asp Tyr Val Phe Pro Gly Asp Thr Arg Ser Pro Tyr Gly Glu Ser Asp 
100 105 110 

Ala Pro Gly Pro Arg Thr Val Tyr Gly Arg Ser Lys Leu Ala Gly Glu 
115 120 125 

Arg Ala Val Leu Ser Leu Leu Pro Asp Thr Gly Thr Val Val Arg Thr 
130 135 140 

Ala Trp Leu Tyr Gly Gly Gin Gly Arg Ser Phe Val Arg Thr Met Leu 
145 150 155 160 

Glu Arg Ala Pro Asp Asp Gly His Val Asp Val Val Asn Asp Gin Trp 
165 170 175 

Gly Gin Pro Thr Trp Ala Gly Asp Val Ala Arg Leu Leu Val Thr Leu 
180 185 190 

Ala Arg Thr Pro Pro Asp Arg Ala Arg Gly He Phe His Ala Thr Asn 
195 200 205 

Ala Gly Ala Ala Thr Trp Tyr Glu Leu Ala Arg Glu Val Phe Arg Leu 
210 215 220 

Ala Gly Ala Asp Pro Glu Arg Val Arg Pro Val Ala Thr Ala Asp Arg 
225 230 235 240 

Pro Gly Pro Ala Pro Arg Pro Ala Cys Thr Val Leu Gly His Asp Arg 
245 250 255 

Trp Arg Leu Val Gly Val Ala Pro_ Pro Arg Asp Trp Arg Ala Ala Leu 
260 265 270 

Arg Glu Ala Met Arg Gin Leu Leu Pro Gly Gly Arg Leu Arg Asn Leu 
275 280 285 
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Thr Gly Thr 
290 

<210> 9 
<211> 350 
<212> PRT 

<213> Streptomyces nogalater ATCC 27451 

<220> 

<223> "translate of snogK, function: dTDP-glucose-4 , 6-dehydratase' 

<400> 9 

Met Ala Ser His Thr Ser Ala Thr Thr Asp Val Asn lie Leu Val Thr 
15 10 15 

Gly Ala Val Gly Phe lie Gly Ser Ala Tyr Val Arg Met Leu Leu Glu 
20 25 30 

Asn Arg Ala Pro Gly Ala Gly Ala Pro Ala Val Arg Val Thr Val Leu 
35 40 45 

Asp Lys Leu Thr Tyr Ala Gly Asn Leu Thr Asn Leu Asp Ala Val Arg 
50 '55 60 

Gly Asp Arg Leu Arg Phe Val Arg Gly Asp lie Leu Asp Ala Glu Leu 
65 70 75 80 

Val Asp Glu Leu Met Ala His Ser Asp Gin Val Val His Phe Ala Ala 
85 90 95 

Glu Ser His Val Asp Arg Ser lie Arg Ala Ala Asp Asp Phe Val Leu 
100 105 110 

Thr Asn Val Val Gly Thr Gin Arg Leu Leu Asp Ala Ala Leu Arg His 
115 120 125 

Gly Val Glu Pro Phe Val Leu Val Ser Thr Asp Glu Val Tyr Gly Ser 
130 135 140 

lie Ala Ser Gly Ser Trp Pro Glu Glu His Pro Leu Ser Pro Asn Ser 
145 150 155 160 

Pro Tyr Ala Ala Ser Lys Ala Ser Ala Asp Leu Met Ala Phe Ala Cys 
165 170 175 

His Arg Thr His Gly Leu Asp Val Arg Val Thr Arg Cys Ser Asn Asn 
180 185 190 

Tyr Gly Pro Arg Gin His Pro Glu Lys Leu lie Pro Arg Phe Val Thr 
195 200 205 

Asn Leu Leu Asp Gly Leu Pro Val Pro Leu Tyr Gly Asp Gly Arg Asn 
210 215 220 

Val Arg Glu Trp Leu His Val Glu Asp His Cys Arg Gly Val Asp Leu 
225 230 235 240 

Val Arg Thr Ala Gly Arg Pro Gly Gly Val Tyr His lie Gly Gly Gly 
245 250 255 

Arg Glu Leu Ser Asn Arg Glu Leu Val Gly Met Leu Leu Glu Leu Cys 
260 265 270 

Gly Ala Asp Trp Ser Ser Val Arg His Val Pro Asp Arg Lys Gly His 
275 280 285 
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Asp Leu Arg Tyr Ser Leu Asp Trp Gly Arg Ala Arg Glu Glu Leu Gly 
290 295 300 

Tyr Arg Pro Ala Arg Glu Phe Ser Ser Gly Leu Arg Ser Thr Val Gin 
305 310 315 320 

Trp Tyr Arg Glu Asn Arg Ser Trp Trp Glu Pro Leu Lys Arg Gly Val 
325 330 335 

Thr Ala Pro Gly Gly Thr Ser Thr Val Val Pro Gly Val Arg 
340 345 350 

<210> 10 
<211> 134 
<212> PRT 

<213> Streptomyces nogalater ATCC 27451 
<220> 

<223> "translate of siioaL, function: NAME cyclase" 

<400> 10 

Met Val Ser Ala Phe Asn Thr Gly Arg Thr Asp Asp Val Asp Glu Tyr 
15 10 15 

lie His Pro Asp Tyr Leu Asn Pro Ala Thr Leu Glu His Gly lie His 
20 25 30 

Thr Gly Pro Lys Ala Phe Ala Gin Leu Val Gly Trp Val Arg Ala Thr 
35 40 45 

Phe Ser Glu Glu Ala Arg Leu Glu Glu Val Arg lie Glu Glu Arg Gly 
50 55 60 

Pro Trp Val Lys Ala Tyr Leu Val Leu Tyr Gly Arg His Val Gly Arg 
65 70 75 80 

Leu Val Gly Met Pro Pro Thr Asp Arg Arg Phe Ser Gly Glu Gin Val 
85 90 95 

His Leu Met Arg lie Val Asp Gly Lys lie Arg Asp His Arg Asp Trp 
100 105 110 

Pro Asp Phe Gin Gly Thr Leu Arg Gin Leu Gly Asp Pro Trp Pro Asp 
115 120 J.25 

Asp Glu Gly Trp Arg Pro 
130 

<210> 11 
<211> 235 
<212> PRT 

<213> Streptomyces nogalater ATCC 27451 
<220> 

<223> "translate of snoK, function: unknown" 
<400> 11 

Met Pro Asp Pro Gly Gly Pro Thr Thr Ala Glu Asn Leu Ser Lys Glu 
15 10 15 

Ala Val Arg Phe Tyr Arg Glu Gin Gly Tyr Val His lie Pro Arg Val 
20 25 30 

Leu Ser Glu Thr Glu Val Thr Ala Phe Arg Ala Ala Cys Glu Glu Val 
35 40 45 
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Leu Glu Lys Glu Gly Arg Glu lie Ser Gly lie Ala Leu Arg Leu Ala 
50 55 60 

Gly Ala Pro Leu Arg Val Tyr Ser Ser Asp lie Leu Val Lys Glu Pro 
65 70 75 80 

Lys Arg Thr Leu Pro Thr Leu Val His Asp Asp Glu Thr Gly Leu Pro 
85 90 95 

Leu Asn Glu Leu Ser Ala Thr Leu Thr Ala Trp lie Ala Leu Thr Asp 
100 105 110 

Val Pro Val Glu Arg Gly Cys Met Ser Tyr Val Pro Gly Ser His Leu 
115 120 125 

Arg Ala Arg Glu Asp Arg Gin Glu His Met Thr Ser Phe Ala Glu Phe 
130 135 140 

Arg Asp Leu Ala Asp Val Trp Pro Asp Tyr Pro Trp Gin Pro Arg Val 
145 150 155 160 

Ala Val Pro Val Arg Ala Gly Asp Val Val Phe His His Cys Arg Thr 
165 170 175 

Val His Met Ala Glu Ala Asn Thr Ser Asp Ser Val Arg Met Ala His 
180 185 190 

Gly Val Val Tyr Met Asp Ala Asp Ala Thr Tyr Arg Pro Gly Val Gin 
195 200 205 

Asp Gly His Leu Ser Arg Leu Ser Pro Gly Asp Pro Leu Glu Gly Glu 
210 215 220 

Leu Phe Pro Leu Val Thr Ala Gly Thr Arg Gin 
225 230 235 

<210> 12 
<211> 390 
<212> PRT 

<213> Strep-tomyces nogalater ATCC 27451 
<220> 

<223> "translate of snogD , function: glycosyl transferase" 

<400> 12 

Met Arg Val Pro Gly Ser Cys Arg Thr Gly Gly lie Met Arg Ala Leu 
15 10 15 

Phe lie Thr Ser Pro Gly Leu Ser His lie Leu Pro Thr Val Pro Leu 
20 25 30 

Ala Gin Ala Leu Arg Ala Leu Gly His Glu Val Arg Tyr Ala Thr Gly 
35 40 45 

Gly Asp lie Arg Ala Val Ala Glu Ala Gly Leu Cys Ala Val Asp Val 
50 55 60 

Ser Pro Gly Val Asn Tyr Ala Lys Leu Phe Val Pro Asp Asp Thr Asp 
65 70 75 80 

Val Thr Asp Pro Met His Ser Glu Gly Leu Gly Glu Gly Phe Phe Ala 
85 90 95 

Glu Met Phe Ala Arg Val Ser Ala Val Ala Val Asp Gly Ala Leu Arg 
100 105 110 
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Thr Ala Arg Ser Trp Arg Pro Asp Leu Val Val His Thr Pro Thr Gin 
115 120 125 

Gly Ala Gly Pro Leu Thr Ala Ala Ala Leu Gin Leu Pro Cys Val Glu 
130 135 140 

Leu Pro Leu Gly Pro Ala Asp Ser Glu Pro Gly Leu Gly Ala Leu lie 
145 150 155 160 

Arg Arg Ala Met Ser Lys Asp Tyr Glu Arg His Gly Val Thr Gly Glu 
165 170 175 

Pro Thr Gly Ser Val Arg Leu Thr Thr Thr Pro Pro Ser Val Glu Ala 
180 185 190 

Leu Leu Pro Glu Asp Arg Arg Ser Pro Gly Ala Trp Pro Met Arg Tyr 
195 200 205 

Val Pro Tyr Asn Gly Gly Ala Val Leu Pro Asp Trp Leu Pro Pro Ala 
210 215 220 

Ala Gly Arg Arg Arg lie Ala Val Thr Leu Gly Ser lie Asp Ala Leu 
225 230 235 240 

Ser Gly Gly lie Ala Lys Leu Ala Pro Leu Phe Ser Glu Val Ala Asp 
245 250 255 

Val Asp Ala Glu Phe Val Leu Thr Leu Gly Gly Gly Asp Leu Ala Leu 
260 265 270 

Leu Gly Glu Leu Pro Ala Asn Val Pro Val Val Glu Trp lie Pro Leu 
275 280 285 

Gly Ala Leu Leu Glu Thr Cys Asp Ala lie lie His His Gly Gly Ser 
290 295 300 

Gly Thr Leu Leu Thr Ala Leu Ala Ala Gly Val Pro Gin Cys Val lie 
305 310 315 320 

Pro His Gly Ser Tyr Gin Asp Thr Asn Arg Asp Val Leu Thr Gly Leu 
325 330 335 

Gly lie Gly Phe Asp Ala Glu Ala Gly Ser Leu Gly Ala Glu Gin Cys 
340 345 350 

Arg Arg Leu Leu Asp Asp Ala Gly Leu Arg Glu Ala Ala Leu Arg Val 
355 360 365 

Arg Gin Glu Met Ser Glu Met Pro Pro Pro Ala Glu Thr Ala Ala Lys 
370 375 380 

Leu Val Ala Leu Ala Gly 
385 390 

<210> 13 

<211> 275 

<212> PRT 

<213> Streptomyces nogalater ATCC 27451 
<220> 

<223> "translate of snoVI , function: unknown" 

<400> 13 

Met Thr Val Leu Val Thr Gly Ala Thr Gly Asn Val Gly Arg His Val 
15 10 15 
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Val Thr Gly Leu Leu Ala Ala Gly Arg Arg Val Arg Ala Leu Thr Arg 
20 25 30 

Thr Pro Asp Arg Ser Gly Leu Pro Gly Gly Ala Glu He Thr Gly Gly 
35 40 45 

Asp Leu Thr Arg Pro Glu Thr Tyr Glu Arg Met Leu Asp Gly Val Glu 
50 55 60 

Ala Val Tyr Leu Phe Pro Val Pro Glu Thr Ala Ala Ala Phe Ala Gly 
65 70 75 80 

Ala Ala Arg Arg Ala Gly Val Arg Arg He Val Val Leu Ser Ser Asp 
85 90 95 

Ser Val Thr Asp Gly Thr Asp Thr Gly Gly His Arg Arg Val Glu Leu 
100 105 110 

Ala Val Glu Asp Thr Gly Leu Glu Trp Thr His Val Arg Pro Gly Glu 
115 120 125 

Phe Ala Leu Asn Lys Val Thr Leu Trp Ala Pro Ser He Arg Ala Glu 
130 135 140 

Gly Val Val Arg Ser Ala Tyr Pro Asp Ala Arg Val Ala Pro Val His 
145 150 155 160 

Glu Ala Asp Val Ala Ala Val Ala Val Thr Ala Leu Leu Lys Glu Gly 
165 170 175 

His Ala Gly Arg Ala Tyr Ser Val Thr Gly Pro Gin Ala Leu Thr Gin 
180 185 190 

Arg Glu Gin Val Arg Ala Val Gly Glu Gly Leu Gly Arg Ser Leu Ala 
195 200 205 

Phe Val Glu Val Thr Pro Gly Gin Ala Arg Ala Asp Leu Thr Ala Gin 
210 215 220 

Gly Leu Pro Ala Pro He Ala Asp Tyr Val Leu Ala Phe Gin Ala Gly 
225 230 235 240 

Trp Thr Glu Arg Pro Ala Pro Ala Arg Pro Thr Val Arg Glu Val Thr 
245 250 255 

Gly Arg Pro Ala Arg Thr Leu Ala Gin Trp Ala Ala Asp His Arg Ala 
260 265 270 

Asp Phe Arg 
275 

<210> 14 
<211> 424 
<212> PRT 

<213> Streptomyces nogalater ATCC 27451 

<220> 

<223> "translate of s/iogE, function: glycosyl transferase" 

<400> 14 

Val, Arg Val Leu Leu Thr Ser Phe Ala Met Asp Ala His Phe Cys Thr 
15 10 15 

Ala Val Pro Leu Ala Trp Ala Leu Arg Ser Ala Gly His Glu Val Arg 
20 25 3G 
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Val Ala Gly Gin Pro Ala Leu Thr Ser Thr lie Thr Gly Ala Gly Leu 
35 40 45 

Thr Ala Val Pro Val Gly Arg Asp His Thr His Gly Ser Leu Leu Gly 
50 55 60 

Ara Val Gly Ser Asp lie Leu Ala Leu His Asp Glu Ala Asp Tyr Leu 
65 70 75 80 

Glu Ala Arg His Asp Ala Leu Gly Phe Glu Phe Leu Lys Gly His Asn 
85 90 95 

Thr Val Met Ser Ala Leu Phe Tyr Ser Gin lie Asn Asn Asp Ser Met 
100 105 110 

Val Asp Asp Leu Val Asp Phe Ala Arg His Trp Arg Pro Asp Leu Val 
115 120 125 

Val Trp Glu Pro Phe Thr Phe Ala Gly Ala Val Ala Ala Arg Ala Ser 
130 135 140 

Glv Ala Ala His Ala Arg Leu Leu Ser Phe Pro Asp Leu Phe Leu Ser 
145 150 155 160 

Thr Arg Arg Leu Phe Leu Glu Arg Met Ala Arg Gin Glu Pro Glu His 
165 170 175 

His Asp Asp Thr Leu Ala Glu Trp Leu Asp Trp Thr Leu Gly Arg His 
180 185 190 

Gly His ser Phe Asp Glu Glu lie Val Thr Gly Gin Trp Ser He Asp 
195 200 205 

Gin Thr Pro Ala Pro Val Arg Leu Asp Ala Gly Gly Pro Thr Val Pro 
210 215 220 

Met Arq Tyr Val Pro Tyr Ser Gly Leu Val Pro Thr Val Val Pro Asp 
225 230 235 240 

Trp Leu Arg Arg Pro Pro Glu Arg Pro Arg Val Leu Val Thr Leu Gly 
245 250 255 

He Thr Ser Arg Arg Val Lys Ser Phe Leu Ala Val Ser Val Asp Asp 
260 265 270 

Leu Phe Glu Ala Val Ala Gly Leu Gly Val Glu Val Val Ala Thr Leu 
275 280 285 

Asp Ala Asp Gin Arg Glu Leu Leu Gly Arg Val Pro Asp His Phe Arg 
290 295 300 

He Val Glu His Val Pro Leu Asp Ala Val Leu Pro Thr Cys Ser Ala 
305 310 315 320 

He Val His His Gly Gly Ala Gly Thr Trp Ser Thr Ala Ala Val Tyr 
325 330 335 

Gly Val Pro Gin Val Ser Leu Gly Ser Met Trp Asp His Phe Tyr Arg 
340 345 350 

Ala Arg Arg Leu Glu Glu Leu Gly Ala Gly Leu Arg Leu Pro Ser Gly 
355 360 365 

Glu Leu Thr Ala Glu Gly Leu Arg Thr Arg Leu Glu Arg Val Leu Gly 
370 375 380 
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Glu Pro Ser Phe Gly Thr Ala Ala Gin Ala Leu Ser Asp Thr He Ala 
385 390 395 400 

Ala Glu Pro Ser Pro Ser Glu Val Val Pro Val Leu Glu Glu Leu Thr 
405 410 415 

Gly Arg His Arg Pro Gly Thr Arg 
420 

<210> 15 
<211> 139 
<212> PRT 

<213> Streptomyces nogalater ATCC 27451 

<220> 

<223> "translate of snoL, function: unknown" 

<400> 15 

Met Ser Thr Thr Ala Asn Lys Glu Arg Cys Leu Glu Met Val Ala Ala 
I 5 10 15 

Trp Asn Arg Trp Asp Val Ser Gly Val Val Ala His Trp Ala Pro Asp 
20 25 30 

Val Val His Tyr Asp Asp Glu Asp Lys Pro Val Ser Ala Glu Glu Val 
35 40 45 

Val Arq Arq Met Asn Ser Ala Val Glu Ala Phe Pro Asp Leu Arg Leu 
50 55 60 

Asn Val Arq Ser He Val Gly Glu Gly Asp Arg Val Met Leu Arg He 
65 70 75 80 

Thr Cys Ser Ala Thr His Gin Gly Val Phe Met Gly He Ala Pro Thr 
85 90 95 

Gly Arq Lys Val Arg Trp Thr Tyr Leu Glu Glu Leu Arg Phe Ser Glu 
100 105 110 

Ala Gly Lys Val Val Glu His Trp Asp Val Phe Asn Phe Ser Pro Leu 
115 120 125 

Phe Arg Asp Leu Gly Val Val Pro Asp Gly Leu 
130 135 

<210> 16 

<211> 155 

<212> PRT 

<213> Streptomyces nogalater ATCC 27451 

<220> ^ .^^ 

<223> "translate of st^oO, function: homologous to mtmX of mithramycxn 

cluster" 
<400> 16 

Met Ser Val Arg Thr Asp Gin Thr Ala Ala Pro Glu Asp Arg Ala Ala 
15 10 15 

Ala Thr Asp Pro Gly Phe Gly His Leu Tyr Ala Gin Val Gin Gin Phe 
20 25 30 

Tvr Ala Arq Gin Met Gin Leu Leu Asp Ser Gly Ala Ala Glu Glu Trp 
35 40 45 
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Ala Ala Thr Phe Thr Glu Asp Gly Thr Phe Ala Arg Pro Ser Ser Pro 
50 55 60 

Glu Pro Ala Arg Gly His Ala Glu Leu Ala Ala Gly Ala Arg Ala Ala 
65 70 75 80 

Ala Glu Arg Leu Ala Ala Glu Gly Leu Ser His Arg His Val lie Gly 
85 90 95 

Met Thr Ala Val Arg Arg Glu Pro Asp Gly Ser Val Phe Val Arg Ser 
100 105 110 

Tyr Ala Gin Val Phe Ala Thr Arg Arg Gly Glu Ala Pro Arg Leu His 
115 120 125 

Leu lie Cys Val Cys Glu Asp Val Leu Val Arg Glu Gly Pro Gly Leu 
130 135 140 

Lys Val Arg Glu Arg Val Val Thr His Asp Ala 
145 150 155 

<210> 17 
<211> 281 
<212> PRT 

<213> Streptomyces nogalater ATCC 27451 

<220> 

<223> "translate of snoaF , function: C-7 ketoreductase" 

<400> 17 

Val Arg Ala Met Thr Asp Ser Thr Gly Pro Arg Pro Val Pro Ala Met 
15 10 15 

Ser Pro Ala Pro Ser Pro Thr Pro Ser Pro Gly Pro Ala Pro Gly Ser 
20 25 30 

Glu Pro Ala Pro Leu Ala Val He Val Thr Gly Gly Gly Ser Gly He 
35 40 45 

Gly Arg Ala Thr Ala Arg Ala Phe Ala Ala Gin Gly Ala Lys Val Leu 
50 55 60 

Val Val Gly Arg Thr Glu Asp Ala Leu Ala Gin Thr Ala Glu Gly Cys 
65 70 75 80 

Ala Asp Met Arg Val Leu Val Ala Asp Val Ala Ser Pro Asp Gly Pro 
85 90 95 

Gin Ala Val Val Asn Ala Ala Leu Arg Glu Phe Gly Arg He Asp Val 
100 105 110 

Leu Val Asn Asn Ala Ala Val Ala Gly Met Glu Thr Leu Gin Thr Val 
115 120 125 

Asp Arg Asp Ala Val Ala Arg Gin Phe Gly Thr Asn Leu Thr Ala Pro 
130 135 140 

Leu Phe Leu Val Gin Ser Ala Leu Gly Ala Leu Glu Lys Ser Arg Gly 
145 150 155 160 

He Val Val Asn Val Gly Thr Ala Ala Thr Leu Gly Leu Arg Ala Ala 
165 170 175 

Pro Thr Gly Ala Leu Tyr Gly Ala Ser Lys Val Ala Leu Asp Tyr Leu 
180 185 190 
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Thr Arg Thr Trp Ala Val Glu Leu Ala Pro Arg Gly lie Arg Val Val 
195 200 205 

Gly Val Ala Pro Gly Val lie Asp Thr Gly He Gly Val Arg Met Gly 
210 215 220 

Met Thr Pro Glu Gly Tyr Arg Glu Phe Leu Thr Gly Met Gly Gly Arg 
225 230 235 240 

Val Pro Val Gly Arg Val Gly Arg Pro Glu Asp Val Ala Trp Trp He 
245 250 255 

Val Gin Leu Ala Arg Pro Glu Ala Gly Tyr Ala Thr Gly Met Val Val 
260 265 270 

Pro Val Asp Gly Gly Leu Ser Leu Val 
275 280 

<210> 18 
<211> 190 
<212> PRT 

<213> Streptomyces nogalater ATCC 27451 

<220> 

<223> "translate of snoU, function: unknown" 

<400> 18 

Val Gin Glu Thr Glu Pro Gly Val Pro Ala Asp Leu Pro Ala Glu Ser 
15 10 15 

Asp Pro Ala Ala Leu Glu Arg Leu Ala Ala Arg Tyr Arg Arg Asp Gly 
20 25 30 

Tyr Val His Val Pro Gly Val Leu Asp Ala Gly Glu Val Ala Glu Tyr 
35 40 45 

Leu Ala Glu Ala Arg Arg Leu Leu Ala His Glu Glu Ser Val Arg Trp 
50 55 60 

Gly Ser Gly Ala Gly Thr Val Met Asp Tyr Val Ala Asp Ala Gin Leu 
65 70 75 80 

Gly Ser Asp Thr Met Arg Arg Leu Ala Thr His Pro Arg He Ala Ala 
85 90 — 95 

Leu Ala Glu Tyr Leu Ala Gly Ser Pro Leu Arg Leu Phe Lys Leu Glu 
100 105 110 

Val Leu Leu Lys Glu Asn Lys Glu Lys Asp Ala Ser Val Pro Thr Ala 
115 120 125 

Pro His His Asp Ala Phe Ala Phe Pro Phe Ser Thr Ala Gly Thr Ala 
130 135 140 

Leu Thr Ala Trp Val Ala Leu Val Asp Val Pro Val Glu Arg Gly Cys 
145 150 155 160 

Met Thr Phe Val Pro Gly Ser His Leu Leu Pro Asp Pro Asp Thr Gly 
165 170 175 

Asp Glu Pro Trp Ala Gly Ala Phe Thr Arg Pro Gly Glu He 
180 185 190 



